Near-synonym Lexical Choice in Latent Semantic Space

نویسندگان

  • Tong Wang
  • Graeme Hirst
چکیده

We explore the near-synonym lexical choice problem using a novel representation of near-synonyms and their contexts in the latent semantic space. In contrast to traditional latent semantic analysis (LSA), our model is built on the lexical level of co-occurrence, which has been empirically proven to be effective in providing higher dimensional information on the subtle differences among near-synonyms. By employing supervised learning on the latent features, our system achieves an accuracy of 74.5% in a “fill-in-the-blank” task. The improvement over the current state-of-the-art is statistically significant. We also formalize the notion of subtlety through its relation to semantic space dimensionality. Using this formalization and our learning models, several of our intuitions about subtlety, dimensionality, and context are quantified and empirically tested.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Contextual Influences on Near-Synonym Choice

One of the least-understood aspects of lexical choice in Natural Language Generation is choosing between near-synonyms. Previous studies of this issue, such as Edmonds and Hirst [4], have focused on semantic differences between near-synonyms, as analysed by lexicographers. Our empirical analysis of near-synonym choice in weather forecasts, however, suggests that other factors are probably more ...

متن کامل

A corpus-based evaluation method for Distributional Semantic Models

Evaluation methods for Distributional Semantic Models typically rely on behaviorally derived gold standards. These methods are difficult to deploy in languages with scarce linguistic/behavioral resources. We introduce a corpus-based measure that evaluates the stability of the lexical semantic similarity space using a pseudo-synonym same-different detection task and no external resources. We sho...

متن کامل

A corpus-based evaluation method for Distributional Semantic Models

Evaluation methods for Distributional Semantic Models typically rely on behaviorally derived gold standards. These methods are difficult to deploy in languages with scarce linguistic/behavioral resources. We introduce a corpus-based measure that evaluates the stability of the lexical semantic similarity space using a pseudo-synonym same-different detection task and no external resources. We sho...

متن کامل

Semantic Representations of Near-Synonyms for Automatic Lexical Choice

Semantic Representations of Near-Synonyms for Automatic Lexical Choice Philip Edmonds Doctor of Philosophy Graduate Department of Computer Science University of Toronto 1999 We develop a new computational model for representing the fine-grained meanings of nearsynonyms and the differences between them. We also develop a sophisticated lexical-choice process that can decide which of several near-...

متن کامل

Word Type Effects on L2 Word Retrieval and Learning: Homonym versus Synonym Vocabulary Instruction

The purpose of this study was twofold: (a) to assess the retention of two word types (synonyms and homonyms) in the short term memory, and (b) to investigate the effect of these word types on word learning by asking learners to learn their Persian meanings. A total of 73 Iranian language learners studying English translation participated in the study. For the first purpose, 36 freshmen from an ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010